Bagging for robust non-linear multivariate calibration of spectroscopy
نویسندگان
چکیده
This paper presents the application of the bagging technique for non-linear regression models to obtain more accurate and robust calibration of spectroscopy. Bagging refers to the combination of multiple models obtained by bootstrap re-sampling with replacement into an ensemble model to reduce prediction errors. It is well suited to “non-robust” models, such as the non-linear calibration methods of artificial neural network (ANN) and Gaussian process regression (GPR), in which small changes in data or model parameters can result in significant change in model predictions. A specific variant of bagging, based on sub-sampling without replacement and named subagging, is also investigated, since it has been reported to possess similar prediction capability to bagging but requires less computation. However, this work shows that the calibration performance of subagging is sensitive to the amount of sub-sampled data, which needs to be determined by computationally intensive cross-validation. Therefore, we suggest that bagging is preferred to subagging in practice. Application study on two near infrared datasets demonstrates the effectiveness of the presented approach.
منابع مشابه
Multivariate calibration of near infrared spectroscopy in the presence of light scattering effect: a comparative study
When analyzing heterogeneous samples using spectroscopy, the light scattering effect introduces non-linearity into the measurements and deteriorates the prediction accuracy of conventional linear models. This paper compares the prediction performance of two categories of chemometric methods: pre-processing techniques to remove the non-linearity, and non-linear calibration techniques to directly...
متن کاملArtificial neural networks as a multivariate calibration tdol: modeling the Fe-Cr-Ni system in x-ray fluorescence spectroscopy
The performance of artificial neural networks (ANNs) for modeling the Cr-Ni-Fe system in quantitative x-ray fluorescence spectroscopy was compared with the classical Rasberry-Heimich model and a previously published method applying the linear learning machine in combination with singular value decomposition. Apart from determining lf ANNs were capable of modeling the desired non-linear relation...
متن کاملDevelopment of near infrared reflectance spectroscopy (NIRS) calibration model for estimation of oil content in a worldwide safflower germplasm collection
The development of NIRS calibration model as a rapid, precise, robust, and cost-effective method to estimate oil content in ground seeds of worldwide safflower germplasm collection grown under different agro-climatic conditions was the key objective of this research project. The oil content was measured by accelerated solvent extraction method in a total of 328 samples collected across 2004 (16...
متن کاملDetermination of Protein and Moisture in Fishmeal by Near-Infrared Reflectance Spectroscopy and Multivariate Regression Based on Partial Least Squares
The potential of Near Infrared Reflectance Spectroscopy (NIRS) as a fast method to predict the Crude Protein (CP) and Moisture (M) content in fishmeal by scanning spectra between 1000 and 2500 nm using multivariate regression technique based on Partial Least Squares (PLS) was evaluated. The coefficient of determination in calibration (R2C) and Standard Error of Calibra...
متن کاملDevelopment of a Sensitive Spectrofluorometric-Multivariate Calibration Method for Enzyme Kinetic of Aldehyde Oxidase
Attempts to obtain experimental values for the kinetic parameters of phenanthridine oxidation by guinea pig or rabbit liver aldehyde oxidase using common spectrophotometric methods have not been successful due to a lower limit of detection. In the present study, a new spectrofluorimetric assay in combination with a multivariate calibration method for enzymatic kinetic study of aldehyde oxidase ...
متن کامل